Pixel-based versus object-based identification of scenic resources using Gaofen-2 images: A case study of Yesanpo National Park

Scenic resources can serve as symbols of a region’s natural resources and culture and are often the stimulus for the development of national parks. Thus, careful scientific planning and effective management based on the identification and evaluation of scenic resources are key for the sustainable development of national parks. In this study, one object-oriented and three pixel-based (maximum likelihood classification, neural network, and support vector machine) classification methods were applied to identify scenic resources in Yesanpo National Park using high-resolution Gaofen-2 images. The classification accuracy of these scenic resources was evaluated through systematic sampling, which improved the objectivity and accuracy of the classification precision evaluation. All methods met the precision requirements of scenic resource identification, and the accuracy of object-oriented classification was the highest. The application scope of the different methods varies, and suitability can be determined according to the needs of scenic resource recognition. Collectively, this study has proposed an effective and practical method for the identification of scenic resources within Yesanpo National Park, which is of significance for its future planning and management. Moreover, this strategy can be applied by other national park planners to select areas for tourism development, formulate sustainable development strategies, and provide technical support and decision-making guidance for national park planning and management.


Introduction
National parks have ornamental, cultural, and scientific value. The natural and cultural landscapes in these areas are relatively concentrated, thereby allowing people to readily frequent the areas to conduct scientific, recreational, and cultural activities. Moreover, national parks have an important role in protecting ecology, biodiversity, natural environment, and cultural heritage, while also contributing to the development of tourism, carrying out of scientific research and cultural education activities, and promoting the sustainable development of local economy and society [1,2]. Indeed, leisure tourism and cultural tourism within national parks represent the fastest growing types of tourism. As such, strategic planning and tourism development have become the focus of research related to national parks. Specifically, the accurate Thus, substantial research has been performed to identify individual scenic resources, such as forests, vegetation, and landforms, whereas there is a dearth of research on the identification and evaluation of scenic resource systems. Specifically, no study has reported the construction and application of a scenic resource identification and evaluation system based on GF-2 images combined with 3S technology. This also reflects the limitations of the current research and application of the scenic resource identification system. In fact, researchers, planners, and management personnel often fail to conduct comprehensive and detailed investigations on national parks and instead conduct fieldwork on only scenic resources within certain areas of the national park. Moreover, the use of manual surveys, document collection, and manual visual recognition to guide the planning, protection, and development of scenic resources will inevitably have challenges, including insufficient realism and poor operability. Therefore, the development of strategies that facilitate the comprehensive, efficient, and accurate identification of scenic resources has become increasingly important.
In this study, the Yesanpo National Park was selected as the study area and GF-2 images were used as the data source to determine the classification method most suitable for identifying scenic resources. eCognition was then employed to determine the optimal segmentation scale of GF-2 images based on the characteristics of the image and the scenic resources to be identified. The decision tree algorithm was used to perform object-oriented classification, determine the identification rules and feature parameter combinations, and identify the scenic resources. Additionally, pixel-based MLC, SVM, and NN classification methods were applied to identify scenic resources. The systematic sampling points were then used to calculate the confusion matrix and kappa coefficient to evaluate the classification accuracy of the four classification methods. Finally, the applicable scope of object-oriented and pixel-based classification methods was assessed to determine which method was most suitable for the identification of scenic resources in GF-2 images. These findings will guide natural national park personnel in the efficient and accurate identification of scenic resources to facilitate the selection of suitable areas for the development of leisure tourism and cultural tourism, promote the scientific protection and development of scenic resources, formulate sustainable development strategies for the national park, and improve regional competitiveness.
This study used global positioning system field measurements, RS image recognition, and geographic information system (GIS) spatial analysis, along with other technical means and research methods based on multiple data sources (GF-2 images, forest resource planning, and design survey data), to effectively establish a multi-platform approach of identifying and evaluating scenic resources that is technically difficult and highly comprehensive. Currently, there is minimal research in the field of scenic resources; therefore, the application of this study provides a theoretical basis and sound technical support for the systematic, high-precision, digital identification and evaluation of scenic resources.

Study area
Yesanpo National Park covers a total area of 505.48 km 2 and is located in the northwest region of Laishui County, near Baoding City, Hebei Province, China, bordering the Fangshan District, Beijing. Located in the deep mountain region at the eastern foot of the northern Taihang Mountains and southern foot of the western Yanshan Mountains, the park is designated as a world geopark, National Park of China, National AAAAA-level Tourist Area, National Forest Park, and National Ecological Tourism Demonstration Area. The park is located on the "step" of the North China Plain, leading up to the Shanxi Plateau, comprising a unique combination of geological features including an alluvial valley, granite fracture structures, a waterfall and canyon, and a karst-cave spring landscape, collectively formed under the action of external geological forces [35,36]. Yesanpo was one of the first national parks in China to develop tourism. Its scenic resources are highly typical and representative of Northern China and have driven the economic development of the national park and the surrounding areas since the development of tourism in this area after 1986. However, the contradiction between development and protection of the scenic resources has become increasingly prominent with the rapid increase of socio-economic development; therefore, it is crucial to identify the quantity, distribution, scale, and combination of scenic resources in the park accurately and comprehensively, as well as formulate a basis for reasonable and effective scenic area planning and promote the sustainable use of the scenic resources in the area.
According to relevant data from the national park, three typical landform units in the area were selected as representative study sites: the Baili Gorge, an erosional, alluvial landform in the Roach Valley; Longmen Tianguan, a granite fracture structure canyon landform; and Yugu Cave, a karst-cave spring landform [37]. Within the national park, the Baili Gorge is located in the southwest with a total area of 154.0 km 2 , the Longmen Tianguan is located in the northwest with a total area of 66.2 km 2 , and the Yugu Cave is located in the middle with a total area of 57.5 km 2 .

Data preprocessing
GF-2 image data taken on April 16, 2018 from Yesanpo National Park were purchased based on the needs of the study. Using ENVI and ArcGIS to preprocess RS image data, such as ortho-correction, fusion processing (Gram-Schmidt method), geometric correction, and mosaic cropping [38], we eliminated the errors of the images and improved the initial accuracy and resolution. Additionally, ortho-correction was used to address obvious geometric distortions caused by terrain, camera geometry, and sensor-related errors. Geometric correction was then used to eliminate the geometric distortion in the image and produce a new image that meets the requirements of a certain map projection or graphic expression, as shown in Fig 1. Through visual interpretation and field investigation, we obtained a priori knowledge of the characteristics of scenic resources in certain sample areas in GF-2 images, which guided our selection of a certain number of samples for each category. According to the Jeffries-Matusita test results, the sample separation degree in GF-2 image was 1.8030-2.0000, which is greater than 1.8, indicating that the samples have significant differences and good separability and can be used as training samples for dividing scenic resources.

Object-oriented method for scenic resource recognition
Selection of the optimal segmentation scale. Object-oriented scenic resource classification uses eCognition software as the platform to classify GF-2 images, the basis and key technology of which is RS image data segmentation. For this, the selection and estimation of segmentation parameters are crucial to determine the maximum degree of heterogeneity of RS image data segmentation in the national park; furthermore, these parameters also relate to the accuracy of the digital identification of scenic resources. Therefore, the selection of segmentation parameters must consider both the spatial resolution of the GF-2 image data and attributes of scenic resources in the image. Otherwise, an excessively large segmentation scale may easily produce an over-segmentation phenomenon and low classification accuracy; likewise, a segmentation scale that is too small may lead to broken image segmentation, longer running times, and lower efficiency [39]. Therefore, determining the optimal segmentation scale for GF-2 image data for the national park is a priority in research on object-oriented classification.
Based on the spatial resolution of GF-2 image data and attributes of five types of scenic resources (forest, grassland/shrub, farmland, architecture, and water), to select the best segmentation parameters, several experiments on the segmentation scale parameters were conducted. Experiments with segmentation effects of different parameters and six typical segmentation scales between 20 and 200 (20, 50, 100, 120, 150, and 200) were selected for comparison (shape = 0.1, compactness = 0.5). The effect maps of the area considering multiple types of scenic resources in the national park at different scales are shown in Fig 2. By analyzing Fig 2, it can be seen that the segmentation effect varies with segmentation scale. The following observations were noted: 1. At a segmentation scale of 20 (Fig 2A), the fragmentation of image segmentation is too high, producing too many segmented objects; the scenic resources are decomposed into multiple polygons, such that only some objects are segmented clearly; the shapes are regular; and the boundaries between scenic resources are obvious.
2. At a segmentation scale of 50 (Fig 2B), there were much fewer segmented image objects than on the previous scale. Larger scenic resources such as forest, grassland/shrub, and architecture are segmented into multiple polygons, and images of forest and grassland/ shrub are more fragmented, yet also more clearly demarcated from architecture and waters. Based on the object-oriented segmentation effect evaluation method proposed by Corcoran [40] and Zhang [41], the unsupervised segmentation evaluation method based on the heterogeneity measure was used to evaluate the segmentation effect of GF-2. When the segmentation scale is 100, the minimum value is 0.8416, indicating a superior segmentation effect. In summary, upon comparing the segmentation results of GF-2 images of the national park, the optimal segmentation effect was selected at the following settings: segmentation scale of 100, shape = 0.1, and compactness = 0.5. These settings were determined suitable for the identification and extraction of five types of scenic resources in the national park for this study: forest, grassland/shrub, farmland, architecture, and water.
Classification of image data based on optimal segmentation scale. The decision tree algorithm was used for object-oriented classification of the national park to establish recognition rules to decompose the attributes of different scenic resources in a step-wise manner. By

PLOS ONE
analyzing and comparing the data of each waveband of the GF-2 images, the recognition rules were constructed by analyzing the relationship between the attributes of the five types of scenic resources in the scenic resource identification and evaluation system, as shown in Fig 3. The classification features were initially determined according to the recognition rules, and the results are shown in Table 1. On this basis, a combination of feature parameters was then determined, as shown in Fig 4. The established recognition rules were followed to classify the five types of scenic resources: forest, grassland/shrub, farmland, architecture, and water. The resulting scenic resource recognition effect after segmentation is shown in Fig 5. The features of mountain shadows in RS images are similar to those of trees and water. Thus, to improve the classification accuracy of forests and water during identification, the images were revised with artificial visual interpretation.

Methods for evaluating the classification accuracy of different landform units
Often, differences occur between the classification results of RS images and the actual situation; thus, an accurate evaluation of the classification results is needed to determine the  Table 1. Initial classification characteristics of scenic resources.

Identifying information Features
accuracy and reliability of the classification. In this study, a confusion matrix was used for accuracy evaluation [42], and its indices are listed in Table 2. The kappa coefficient was calculated in the range of 0-1 [43], and the values were generally divided into five groups to indicate the level of consistency: 0.0-0.2 indicates slight consistency; 0.2-0.4 indicates fair consistency; 0.4-0.6 indicates moderate consistency; 0.6-0.8 indicates substantial consistency; and 0.8-1 indicates almost perfect consistency [44]. Systematic sampling was then used to extract the classification results corresponding to each classification method to verify and evaluate the accuracy and differences between pixelbased and object-oriented classification. This was carried out based on the forest resource planning and design survey data of Yesanpo National Park in 2018 combined with the vector data of land use raster maps and field surveys in Yesanpo National Park 2017-2030 master plan; from this, the survey data on forest resource planning and design were newly revised and merged in terms of resource types, and the attributes of the survey data on forest resource planning and design were merged into the five categories used in this study (forest, grassland/

PLOS ONE
shrub, farmland, architecture, and water). The actual scenic resource type corresponding to each sampling point was then extracted from the corrected survey data on forest resource planning and design using the ArcGIS multi-value point extraction function as the true value test sample, with the sampling interval set to 100 × 100 m to effectively represent all classifications in the national park. In total, 27,691 sampling points were obtained, including 12,329 for forests, 12,866 for grassland/shrubs, 1,496 for farmland, 804 for architecture, and 196 for water. Within the target scenic spots, Baili Gorge had 15,345 sample points, Longmen Tianguan had 6,575, and Yugu Cave had 5,771. All three scenic spots contained the above five types of scenic resources, and the number of samples for the minimum scenic resource type was above 10.

Accuracy evaluation of object-oriented scenic resource classification
Accuracy evaluation of object-oriented classification of Baili Gorge. The results of the object-oriented classification of Baili Gorge are shown in Fig 6, the corresponding confusion matrix is shown in Table 3, and the overall classification accuracy of the area is shown in Table 4. Table 4 shows that the overall accuracy of object-oriented classification of Baili Gorge is high (86.20%). The kappa coefficient was 0.77 (between 0.6 and 0.8), indicating a high degree of consistency. The classification described five types of scenic resources: 1. Forest: Primarily classified as grass/shrub (9.42%) and farmland (3.99%) because it can be easily confused with grassland/shrub. Several scattered forests around the farmland exist and show small differences; therefore, these forests can easily be misclassified. The final classification area of forests was smaller than the actual area with a mapping accuracy of 85.90%.
2. Grassland/shrub: Primarily misclassified as forest (5.20%), farmland (3.21%), and architecture (1.63%) because of mixing of some grassland/shrub areas with forests. Crops from some farmlands are similar to those from grassland/shrubs and, therefore, can be easily misclassified. In addition, as the architecture in this area mainly comprises rural settlements and tourist facilities, several vegetation areas comprising grass/shrubs and architecture can be misclassified. The final classification area of grassland/shrub was larger than the actual area with a mapping accuracy of 89.68%.   Table 5, and the classification accuracy is shown in Table 6. Table 6 shows that the overall accuracy of object-oriented classification of Longmen Tianguan is high (86.46%). The kappa coefficient was 0.77 (between 0.6 and 0.8), indicating a high degree of consistency. The classification described in five types of scenic resources: 1. Forest: Mainly misclassified as grass/shrub (10.31%), farmland (7.73%), and architecture (1.17%) because it can be easily confused with grassland/shrub. Furthermore, the presence of several scattered forests around farmlands and architectural structures, having small differences, results in misclassification. The final classification area of forests was smaller than the actual area with a mapping accuracy of 80.75%.
2. Grassland/shrub: Mainly misclassified as forest (3.85%), farmland (2.81%), and architecture (0.95%) because some grassland/shrub areas are mixed with forest, which causes misclassification. The characteristics of the crops planted in the farmland during spring are similar to those in grassland/shrubs and can, therefore, be misclassified. In addition, certain grasses/ shrubs are mistakenly classified as architecture because the architecture in this area mainly comprises rural settlements and tourist facilities; additionally, numerous vegetation areas

PLOS ONE
occur around some architectural structures, thereby causing misclassification. The final classification area of grassland/shrub was larger than the actual area with a mapping accuracy of 92.40%.
3. Farmland: Mainly misclassified as architecture (12.74%), grassland/shrub (8.60%), and forest (4.78%) because of the presence of numerous farmlands around some architectural structures that could cause misclassification. The characteristics of crops planted in farmlands during spring are similar to those of plants in grasslands/shrubs and can, therefore, be easily misclassified. In addition, several scattered forests having small differences exist around farmlands, facilitating misclassification. The final classification area of farmlands was larger than the actual area with a mapping accuracy of 73.89%.
4. Architecture: Primarily classified as farmland (6.96%), grassland/shrub (6.09%), and forest (4.35%) because there are several farmlands, grasslands/shrubs, and forests around the existing architecture. The final architecture classification area was larger than the actual area with a mapping accuracy of 82.61%.

5.
Water: Erroneously divided into architecture (16.67%) and forest (8.33%) because pond stems and riverbanks are divided into architecture and forest. Because the RS data were acquired during the dry season (April), some river beaches are mistakenly classified as architecture. The final classification area of the water was smaller than the actual area with a mapping accuracy of 75.00%.

Accuracy evaluation of object-oriented classification of Yugu Cave
The results of object-oriented classification are shown in Fig 8, the confusion matrix is shown in Table 7, and the classification accuracy is shown in Table 8. Table 8 shows that the overall accuracy of object-oriented classification of Yugu Cave is high (89.08%). The kappa coefficient was 0.82 (between 0.8 and 1), indicating almost perfect consistency. The classification system described five types of scenic resources: 1. Forest: Classified as grass/shrub (3.99%) and farmland (1.67%) because it can be easily confused with grassland/shrub. Furthermore, several scattered forests having small differences exist around the farmlands, thereby causing misclassification. The final classification area of forests was larger than the actual area with a mapping accuracy of 93.60%.
2. Grassland/shrub: Misclassified as forest (6.28%), farmland (3.12%), and architecture (1.10%) because some grassland/shrub areas are mixed with forests, thereby causing misclassification. The characteristics of crops planted in the farmlands during spring are The characteristics of crops planted in the farmlands during spring are similar to those in grasslands/shrubs and can, therefore, be easily misclassified. In addition, several scattered forests having small differences exist around the farmland, resulting in their misclassification. The final classification area of farmlands was larger than the actual area with a mapping accuracy of 74.40%.
4. Architecture: Misclassified as forest (19.54%), farmland (12.64%), and grassland/shrub (8.61%) because of the presence of several farmlands, grasslands/shrubs, and forests around the existing architecture that are easily misclassified. The final architecture classification area was larger than the actual area with a mapping accuracy of 59.20%.

5.
Water: Erroneously divided into architecture (29.63%), farmland (7.41%), and forest (3.70%) as pond stems and river embankments are easily divided into architecture, farmlands, and forest. Because the RS data were acquired during the dry season (April), some river beaches are mistakenly classified as architecture. The final classification area of the water was smaller than the actual area with a mapping accuracy of 59.26%.

Evaluation of pixel-based and object-oriented classification methods
In this study, the evaluation samples were selected by systematic sampling, which is more flexible and objective than the traditional method of obtaining evaluation samples. This method was used to calculate the overall accuracy and kappa coefficient of three pixel-based classification methods (MLC, NN, and SVM); the findings for each pixel-based method were compared with those of the object-oriented classification method. The results are shown in Table 9. The parameters of NN and SVM are shown in Figs 9 and 10. In summary, the overall accuracy of the object-oriented classification method ranged from 86.20-89.08% and the kappa coefficient ranged from 0.77-0.82, showing a good classification result. The main reason for this outcome is that the object-oriented method is able to avoid the

PLOS ONE
excessive confusion that typically occurs between the following combinations of resource types: forest and grassland/shrub; grassland/shrub and farmland; and water, forest, and grassland/shrub. For this method, the difference between forest and grassland/shrub in spectral features was not significant. However, for the pixel-based classification method, which is mainly based on spectral features, the forest and grassland/shrub had little difference and were classified into one category, resulting in a low accuracy of forest classification. The same problem also occurred between grassland/shrub and farmland as their spectral features were similar; thus, it was difficult to make a fine distinction between them. In addition, because of the existence of mountain shadows, the features within the shaded regions were similar to those of water; thus, the pixels of the shaded part were falsely classified as water.

Discussion and conclusion
To select the most suitable classification method for scenic resource identification, four classification methods were considered, object-oriented and three pixel-based (MLC, NN, SVM), to  The results showed that the scenic resources of the three landforms met the requirements of classification accuracy using MLC, NN, SVM, and object-oriented classification and that object-oriented classification had higher accuracy and was more suitable for the systematic and high-precision identification of scenic resources in GF-2 images. The method for scenic resource identification based on GF-2 images was finalized as follows. First, a decision tree model was used to construct a scenic resource identification system, which was in turn used as classification criteria for the identification of five types of scenic resources: forest, grassland/shrub, farmland, architecture, and water. Land use raster map and forest resource planning and design survey data were then incorporated to construct a scenic resource database in GIS and the spatial analysis functions of GIS were used to refine the classification and identification of scenic resources [45]. This approach greatly increases the precision of scenic resource identification and completely utilizes raster and vector data such as GF-2 images, digital elevation models, and forest resource planning and design survey data. The advent of this method will aid in the digitalization and systemization of scenic resource identification. Pixel-based classification primarily analyzes the spectral features of pixels during classification. Conversely, high-resolution RS images usually contain few bands and limited spectral information; thus, they are not only based on the spectral information of pixels but can also be

PLOS ONE
combined with spatial feature information such as shape and texture when classifying highresolution images. Because of this, pixel-based classification fails to fully utilize the rich spatial data of high-resolution images, which leads to wastage of spatial data resources to a certain extent, thereby lowering the classification accuracy and reducing the method's effectiveness in terms of classification and recognition [46].
In comparison, the object-oriented classification method overcomes some defects of pixelbased classification because it combines the spectral, textural, and morphological characteristics of scenic resources in the process of classification, groups the pixels with the same characteristics into a patch by means of clustering, and then selects the patch as a feature for classification according to the characteristics of scenic resources. This process avoids the "saltand-pepper-effect" results of the pixel-based classification method, increasing the differences between the resource combinations of forest and grassland/shrub; grassland/shrub and farmland; and water, forest, and grassland/shrub, effectively improving the overall classification accuracy. Moreover, when object-oriented classification is used to process high-resolution RS images, this method can compensate for the disadvantage of unstable spectral recognition in such images and fully utilize the advantage of the rich spatial data of that image type.
However, it should be noted that although the object-oriented classification method improves the classification accuracy, it does not completely address all the problems pertaining to pixel-based classification methods. For example, more restrictive factors need to be added to the classification to improve the separability among scenic resources, which is the priority of future research. Moreover, the object-oriented classification method has a higher requirement for operators, such that operators must not only master the geometric information, structural information, and spectral information characteristics and establish corresponding classification rules of the scenic resources to be identified but also establish classification rules for geometric, structural, and spectral information and select effective combinations of feature parameters to improve the classification accuracy. As a result, the actual time needed for classification is longer than that needed to carry out pixel-based classification.
Therefore, it can be concluded that the scope of application of pixel-based and object-oriented classification methods is different. Hence, these should be used according to the actual situation and needs of scenic resource surveys. If multi-band, hyperspectral RS images are used, or if rapid screening of scenic resources is required for RS images, pixel-based classification can be used and then combined with historical information to identify key survey areas and field survey routes in GIS. Using this, detailed identification can be carried out using field survey data combined with visual interpretation. Alternately, for the systematic and high-precision identification of scenic resources in high-resolution RS images, object-oriented classification may be used. The scenic resource identification system proposed in this study can be employed to establish classification rules, and effective feature parameter combinations can be selected to improve classification accuracy. The GF-2 images of Baili Gorge, Longmen Tianguan, and Yugu Cave scenery areas are suitable for the object-oriented classification method, which has the highest classification accuracy and is the most efficient.
In the past few years, the use of high-resolution RS images as data sources and that of 3S technology to classify scenic resources have attracted the attention of the geographic information and landscape architecture research communities. The classification of scenic resources can be combined with 3S technology to identify and highlight the spatial combination information of scenic resources. This study proposes a scientific and effective method, using GF-2 images as the data source, as well as pixel-based (MLC, NN, and SVM) and object-oriented classification methods, to identify scenic resources and introduces systematic sampling methods to evaluate classification accuracy. The results show that the object-oriented classification has the highest accuracy in identifying scenic resources among the four classification methods.
Because of the difference between pixel-based and object-oriented classification principles, the scientific evaluation of its classification accuracy has consistently been called into question in geographic information research and application. This study proposes a systematic sampling method that uses the same evaluation index and accuracy evaluation sample in ArcGIS to evaluate scenic resources using pixel-based and object-oriented classification accuracy, which effectively reduces the difference and accuracy evaluation of recognition software (ENVI, eCognition). Hence, this evaluation method can effectively improve the accuracy and objectivity of evaluations based on pixel-based and object-oriented classification accuracy.
The scenic resource identification system constructed in this study is based on a decision tree model and can be used to establish object-oriented classification guidelines. Moreover, it is suitable for the efficient identification of five types of scenic resources, namely, forest, grassland/shrubland, farmland, architecture, and water, thus, the classification accuracy and speed can be effectively improved.
After field investigation and verification of scenic resources in Yesanpo National Park, it was found that the scenic resource data identified by this method better reflect the actual situation of the resources in the park; therefore, this method is suitable for the future application of scenic resource identification in natural national parks. This method show promise for building a scenic resource GIS database and more efficiently managing scenic resource text, charts, and other data. In the future, a variety of spatial analysis tools can be used for the spatial analysis and evaluation of scenic resource data. Therefore, promoting the standardization and informatization of scenic resource identification and evaluation will help improve their accuracy and efficiency [47], effectively providing planners and managers with relevant data for development and environmental protection decision-making. Scientific planning of the district and effective management of the related national park can also provide basic data and decisionmaking support.
One limitation of the current study was that the analyzed data were specific to the month of April. Thus, we need to select and assess the national park RS data for all four seasons to compare and identify changes in the combination of scenic resources in different seasons, thereby fully exploring the value of scenic resources. With the continuous development of 3S technology, we will continue to work toward introducing improved technologies, including higher resolution RS images and drone tilt photography, to the scenic resource identification system to meet the needs of national parks with unique planning and management needs.